Comparative Benchmarking of Causal Discovery Techniques

نویسندگان

  • Karamjit Singh
  • Garima Gupta
  • Vartika Tewari
  • Gautam Shroff
چکیده

In this paper we present a comprehensive view of prominent causal discovery algorithms, categorized into two main categories (1) assuming acyclic and no latent variables, and (2) allowing both cycles and latent variables, along with experimental results comparing them from three perspectives: (a) structural accuracy, (b) standard predictive accuracy, and (c) accuracy of counterfactual inference. For (b) and (c) we train causal Bayesian networks with structures as predicted by each causal discovery technique to carry out counterfactual or standard predictive inference. We compare causal algorithms on two publicly available and one simulated datasets having different sample sizes: small, medium and large. Experiments show that structural accuracy of a technique does not necessarily correlate with higher accuracy of inferencing tasks. Further, surveyed structure learning algorithms do not perform well in terms of structural accuracy in case of datasets having large number of variables. 1 ar X iv :1 70 8. 06 24 6v 2 [ cs .A I] 1 2 Se p 20 17

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A novel feature selection techniques based on contrast set mining

Data classification is a challenging task in era of big data due to high number of features. Feature selection is a step in process of knowledge discovery in data that aims to reduce dimensionality and improve the classification performance. The purpose of this research is to define new techniques for feature selection in order to improve classification accuracy and reduce the time required for...

متن کامل

Benchmarking Link Discovery Systems for Geo-Spatial Data

Linking geo-spatial entities is targeted only by a limited number of link discovery benchmarks. Linking spatial resources requires techniques that differ from the classical, mostly string-based approaches. In particular, considering the topology of the spatial resources and the topological relations between them is of central importance to systems that manage spatial data. Due to the large amou...

متن کامل

A Study of Causal Discovery With Weak Links and Small Samples

Weak causal relationships and small sample size pose two significant difficulties to the automatic discovery of causal models from observational data. This paper examines the influence of weak causal links and varying sample sizes on the discovery of causal models. The experimental results i l lustrate the effect of larger sample sizes for discovering causal models reliably and the relevance of...

متن کامل

Benchmarking Sustainability with Respect to Transportation Supply and Demand

This paper is an endeavor to quantify the concept of sustainable transportation. The prevailing idea in the context of sustainable development (SD) emphasizes on the reduction of transportation demand in order to reduce the environmental and social consequences of it. Nevertheless, in the current paper using a measure for SD, and based on the conformity of the growths of all sectors with transp...

متن کامل

Towards a Framework for Dependability Benchmarking

The goal of dependability benchmarking is to provide generic ways for characterizing the behavior of components and computer systems in the presence of faults, allowing for the quantification of dependability measures. Beyond existing evaluation techniques, dependability benchmarking must provide a reproducible and cost-effective way of performing this evaluation either as stand alone assessmen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1708.06246  شماره 

صفحات  -

تاریخ انتشار 2017